17.8 Metagenomics

267

organisms, from bacteria to man. Later, the sequence of the small subunit of ribosomal

RNA (rRNA), another essential and universal object, was used. 27 Nowadays, one can,

in principle, analyse whole genomes.

A chronology can be established on the premiss that the more changes there are, the

longer the time elapsed since the species diverged (assuming that the changes occur

at a constant rate with respect to sidereal time). This premiss can be criticized since,

although the unit of change is the nucleotide, selection (the engine of speciation) acts

on the amino acid; some nucleotide mutations lead to no change in amino acid due

to the degeneracy of the code. There is actually little real evidence that mutations

occur at random (i.e., with respect to both the site and the type of mutation).

A difficulty with molecular phylogenies is the fact that lateral gene transfer (LGT;

cf. Sect. 14.7.4), especially between bacteria and between archaea, may vitiate the

calculated distances. A plausible counterargument in favour of the use of rRNA is

that it should be unaffected by LGT, due to its fundamental place in cell metabolism.

A further difficulty is a computational one: that of finding the optimal tree, since

usually one is interested in comparing dozens (and ultimately millions) of species.

The basic principle applied to address this problem is that of parsimony: One seeks to

construct the tree with the least possible number of evolutionary steps. Unfortunately,

this is an NP-complete problem and hence the computation time grows exponentially

with the number of species; even a mere 20 species demands the analysis of almost

10 Superscript 221022 possible trees!

17.8

Metagenomics

The ability to culture bacteria in the laboratory, of which Pasteur seems to have been

the pioneer, was a crucial step in the emergence of bacteriology. Pasteur used liquid

media (broths); for the purposes of investigation, however, solid media, introduced

by Koch, are more convenient. Since then, the culture of bacteria has become indis-

pensable to vast areas of medicine, biotechnology, and research, for the identification

and counting of bacteria and for the development of serological assays and vaccines,

to name just a few ways of making use of bacterial cultures. At the same time, it

is recognized that the vast majority of bacteria cannot be cultured. Therefore, the

only way that this extant, almost immanent microbial richness, can be accessed is

by sequencing its genomic signature. The genomes of the entire natural microbiota

collectively constitute the metagenome. 28

The vast increase in DNA sequencing capability—both in terms of hardware and

in algorithms for analysing the raw data—has allowed metagenomics (the study of the

metagenome) to become a practical science. Work begins with the extraction of the

DNA of all the microbes in some environmental sample (e.g., soil, 29 or seawater, or

27 rRNA has been championed by C. Woese (2000).

28 Rondon et al. (2000); Committee on Metagenomics (2007).

29 Ritz (2008).